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There are both practical and foundational motivations to consider the thermodynamics of quan¬ 
tum systems at small scales. Here we address the issue of autonomous quantum thermal machines 
that are tailored to achieve some specific thermodynamic primitive, such as work extraction in the 
presence of a thermal environment, while having minimal or no control from the macroscopic regime. 

Beyond experimental implementations, this provides an arena in which to address certain founda¬ 
tional aspects such as the role of coherence in thermodynamics, the use of clock degrees of freedom 
and the simulation of local time-dependent Hamiltonians in a particular quantum subsystem. For 
small-scale systems additional issues arise. Firstly, it is not clear to what degree genuine ordered 
thermodynamic work has been extracted, and secondly non-trivial back-actions on the thermal ma¬ 
chine must be accounted for. We find that both these aspects can be resolved through a judicious 
choice of quantum measurements that magnify thermodynamic properties up the ladder of length- 
scales, while simultaneously stabilizing the quantum thermal machine. Within this framework we 
show that thermodynamic reversibility is obtained in a particular Zeno limit, and finally illustrate 
these concepts with a concrete example involving spin-systems. 

PACS numbers: 03.67.-a, 05.70.Ln 


I. INTRODUCTION 

The issue of work in arbitrary-scale quantum systems 
turns out to be quite subtle, and a good deal of recent 
studies [msi have analysed varying notions of work for 
finite-sized quantum systems. More recently the role that 
quantum-mechanical properties, such as coherence, play 
in work extraction are being addressed using resource- 
theoretic formulations. For example, it has been pointed 
out in [23] that free energies do not constitute proper co¬ 
herence measures, and so to properly quantify the ther¬ 
modynamic value of quantum coherence it is necessary 
to develop measures that go beyond free energies. In 
subsequent analysis EH EEL general upper and lower- 
bounds have been developed to constrain such coherent 
transformations under very general thermodynamic op¬ 
erations. Moreover, in El E7] the question of work ex¬ 
traction from an arbitrary qubit state has been analysed 
in a context which explicitly models the coherence re¬ 
sources that are required to extract work from the qubit 
state. The analysis recovers the expected result that one 
can indeed associate the free energy difference A F to 
an arbitrary pure qubit state \ip) (ip |, but only within a 
particular “classical regime”, in which one has access to 
an infinite system with unbounded coherence resources. 
Outside of this setting it is provably impossible to extract 
all of the free energy from the quantum coherence. 

The actual implementation of these thermodynamic 
processes often assume a complex protocol. A great deal 
of control is required over the different components in 
order to magnify the energy acquisition up to scales in 
which the notion of ordered, robust energy makes more 
sense. The central aim of this paper to address scenarios 
in which such thermodynamic processes are carried out 
on a quantum system S via a quasi-autonomous thermal 
machine M that is comparable in scale and itself displays 


quantum-mechanical properties. This sheds light on the 
physical characteristics demanded of a quantum thermal 
machine. 

In the case of work extraction, a finite-sized quantum 
machine will absorb energy which, due to its finite dimen¬ 
sion, will diminish its ability to function. We show that 
a resolution to this is to perform an “energy harvesting 
measurement” on the machine. This harvesting measure¬ 
ment serves a dual function of siphoning off energy from 
the machine while also stabilizing its ability to function as 
a thermal machine at these quantum-mechanical scales. 


A. Clocks and quantum coherence 

As highlighted in (23H233 EZ], in order for thermody¬ 
namic processes to be sensitive to quantum coherence at 
arbitrary scales the thermal machine must itself possess 
coherent properties. In |25j this requirement for quan¬ 
tum coherence was identified with transformations that 
break time-translation symmetry. Any quantification of 
the coherence follows from a quantification of this asym¬ 
metry. 

More explicitly, the breaking of time-translation sym¬ 
metry demands that the effects of an action depend on 
whether it is performed at time 1 1 or at some later time 
t 2 > t\. The way in which one handles time-translation 
asymmetry has been known for a long time - we intro¬ 
duce a “clock” system, sensitive to the passage of time 
EH] • In the classical regime we of course have abundant 
access to time-keeping devices, however this becomes 
a more non-trivial component for extremely small, au¬ 
tonomous quantum devices, or in environments in which 
quantum-mechanical aspects dominate. 

Such considerations have also resulted in an increased 
focus on the role of clocks in thermodynamics [H1EZ1ESK 
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133j . To implement autonomous quantum thermodynamic 
protocols, generic thermal machines invariably require a 
clock degree of freedom, which serves as a non-classical 
time-keeping device. This not only allows access to quan¬ 
tum coherence, but also can be used to induce effective 
time-dependent interactions within systems. 


B. Energy transfer to macroscopic scales via 
quantum measurements 

The use of the clock in, say, a work-extraction pro¬ 
tocol necessitates a non-trivial interaction between the 
quantum system S and the clock. The unavoidable back- 
action experienced by the machine is in general accom¬ 
panied by an energy flow that can either be transferred 
to other degrees of freedom in the machine M, or more 
simply maintained in the clock itself. However if the ma¬ 
chine M is comparable in scale to the quantum system 
S then it is debatable to what extent one has “gained 
work” if it is confined to quantum-mechanical degrees of 
freedom in M. A basic requirement is that the acquired 
energy can be transferred to larger scales in some natural 
manner. 

The passage from the quantum regime to the classical 
regime has long been a topic of controversy and debate. 
Where does quantum end and classical begin? Notions 
that are applicable on the classical side of this divide are 
inapplicable on the quantum side, and quantum measure¬ 
ments play a central role in linking these two regimes. 
Depending on which side of the cut one places the mea¬ 
surement device one can either view a measurement as 
an abrupt transformation of the quantum state (e.g.via a 
projective measurement), or one could equally model the 
measurement process itself as a purely unitary interac¬ 
tion between the quantum system S and some measure¬ 
ment apparatus A. The purpose of the unitary interac¬ 
tion Us a is to magnify the quantum-mechanical aspects 
up the ladder of length-scales to degrees of freedom that 
are deemed classical. The apparatus eventually admits 
a read-off and an objective measurement outcome is ob¬ 
tained. 

One particular set of constraints when considering fully 
quantum mechanical measurement approaches is given 
by conservation laws. The connection between conserva¬ 
tion laws and measurements has a long and subtle history. 
The WAY-theorem states that in the presence of a conser¬ 
vation law there is an effective superselection rule in place 
on the observables that can be measured [B] . In 

m it was shown that the measurement device must carry 
two different resources - a coherence resource to partially 
lift the superselection rule, and a charge degree of free¬ 
dom to balance books. The unitary interaction between 
the machine M and the system S is constantly entangling 
the two systems. This unitary correlation process can be 
viewed as an information-acquisition by the machine in 
an effective local energy basis that varies with time. Note 
that this process of M acquiring information on S is dis- 



FIG. 1. Dynamics of the thermal machine. The quan¬ 
tum thermal machine at point A of its orbit, absorbs energy 
from the heat reservoir, which causes a fraction of the total 
state to deviate from the reference clock orbit. Specifically 
a fraction pi(t) evolves to point E off the reference orbit, 
while the remaining fraction (1 — pi(t)) freely evolves to J3. 
The energy-harvesting measurement projects the component 
at E onto B and allows us to extract this energy as well as 
stabilizing the machine back to the reference orbit. Each tri¬ 
angular section constitutes a single unit protocol (UP). The 
total entropy generated corresponds to the integrated “area” 
under the jagged curve, while the extracted work corresponds 
to the integrated “length” of the jagged curve. The reference 
orbit x(t) is the shaded great circle with respect to which the 
protocol is defined. 

tinct from the external measurement performed on M we 
will later consider. 

In what follows, we view the operation of the ther¬ 
mal machine M with the system S and reservoir R as 
transforming an energetic degree of freedom of the de¬ 
vice under p s <g> p M ® 7 R ^ U S m(ps 0 Pm 0 7h)^sm> 
which can then read-off via a subsequent measurement 
on M. This disturbing measurement transfers the en¬ 
ergy acquired by the quantum thermal machine into the 
macroscopic regime, where it can be ascribed a less am¬ 
biguous status. 

In what follows we use thermodynamic work extrac¬ 
tion from a qubit system as our focus. The work ex¬ 
traction involves a quasi-autonomous thermal machine 
based on a globally time-independent Hamiltonian, which 
performs the protocol on the quantum side of the cut, 
together with a continual flow of energy across the cut 
via a classically-controlled measurement process that also 
serves to stabilize the quantum device. 


II. AUTONOMOUS QUANTUM MACHINES: 
THE BASIC CONSTITUENTS 

The traditional Szilard argument begins with the 
knowledge that the system is in one particular state of a 
pair of energetically degenerate levels (e.g. a particle is on 
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the left-hand side of a piston) PJJ RTil - fiTj . Conditioned 
on this knowledge, the agent applies an adapted protocol 
that extracts a certain quantity of work. Schematically 
the unoccupied energy level is elevated by switching on 
a time-dependent Hamiltonian (e.g. attaching weights, 
or tuning magnetic fields). The system is then placed 
in equilibrium with a thermal reservoir at some inverse 
temperature /3, and the Hamiltonian is quasi-statically 
switched off in a classically controlled manner. 

In the fully quantum-mechanical scenario our starting 
point is again knowing that the qubit system S is in a 
definite pure state \ip) (ip\. We next feed this state into 
a thermal machine M, and the composite evolves under 
the joint Hamiltonian Hsm, together with system-bath 
couplings. Crucially, note that we do not assume clas¬ 
sical control over the interaction between the quantum 
systems. Therefore, to obtain a non-trivial extraction of 
energy requires us to induce a particular time-dependent 
evolution of the system S - therefore the machine M 
must possess a clock degree of freedom that induces an 
effective time-varying Hamiltonian on the target system. 
In |31| the authors also consider quantum systems that 
via interactions with their environment act as clocks, and 
study their synchronisation as well as the back-action 
they suffer due to the interaction, although not in a ther¬ 
modynamic setting. The clock in pTTj comprises two com¬ 
ponents, a clockwork which evolves due to an internal 
mechanism, and tick registers which briefly interact with 
the clockwork and extract time information. The notion 
of a clockwork in m is similar to what we call the clock 
itself, while the system S can loosely be seen as playing 
the role of the tick registers. The clock in the present 
study however is in continuous contact with the system 
and evolves with it under a joint unitary. 


A. Induced local time-dependent level splittings 

The known state | ip) (ip | defines an orthonormal basis 
{\ip ), | ip)} in the qubit system. Given this preferred de¬ 
composition, the generic interaction Hamiltonian needed 
on the joint Hilbert space Hsm takes the form 

Hsm = cr <8> Hi + 1 <8> Hp (1) 

where a := \ip) (ip\ — \ ip) (ip\ (see appendix |b|. In particu¬ 
lar, the term a® Hi is what will generate a level-splitting 
local to the qubit, with the time-dependence being en¬ 
coded in the thermal machine’s quantum state pm- The 
second term 1 ® Hp, as we shall see, generates an evolu¬ 
tion on the system M which is not sensitive to the state 
of the qubit, and so can be interpreted as the free Hamil¬ 
tonian of the machine. Crucially, this joint Hamiltonian 
is time-independent , fixed for all eternity. 

In the absence of bath couplings, the system-machine 
composite evolves under this Hamiltonian as psm —t 
e -itH SM p SM e ltH sM _ The function of the interaction with 
the machine is to induce an effective Hamiltonian that is 


local to the qubit. We define 

Hs(t) := tr M [l ® Pm^Hsm], (2) 

which is a time-dependent mean-field Hamiltonian on the 
qubit. This mean-field approximation turns out to be the 
physically appropriate choice in the context considered, 
and has been analysed in more detail in mm- For the 
above interaction Hamiltonian this takes the form 

H s (t) = (H I )a + (H F ) 1. (3) 

This mean-field Hamiltonian defines a local effective ba¬ 
sis., and encodes the non-correlative (and therefore non¬ 
entropy increasing) dynamics local to the system [451 - 
m- Crucially, the thermal bath is assumed to only see 
this local Hamiltonian, which encodes the statement that 
the machine M does not undergo direct thermalization. 
This assumption is equivalent to demanding a large cou¬ 
pling strength between system and bath relative to the 
coupling to the machine, such that the thermalisation 
time-scale of the system is much shorter than that of the 
machine. The exact degree to which thermalization of S 
occurs will depend on the particular bath coupling rates 
(see also appendix |f|. 

Since the energy exchanges with the bath only de¬ 
pend on the level-splitting of H$ it suffices to assume 
Hs(t ) = (Hi) a. Also, note that instead of time being an 
explicit parameter in the system Hamiltonian tuned by 
an external agent, the time-dependence is now induced 
by the dynamics and the particular quantum state of 
the machine M , giving the machine an inbuilt quantum 
clock. The time-dependence is explicitly a function of 
the coherence properties of the state pM with respect to 
the Hamiltonian Hsm , as emphasized previously 
Therefore, the joint Hamiltonian is essentially deter¬ 
mined for the (known) quantum state \ip) (ip | and, to¬ 
gether with an initial joint state 

P(0) = \ip) (ip\ ® Pm(0), (4) 

induces a specific time-dependent level-splitting local to 
the qubit. However one must also address the back-action 
of the dynamics on the machine itself. The joint Hamil¬ 
tonian can be written in the alternative form 

Hsm = \ip) (M ® H_ + \ip) (ip\ <g> H + ( 5 ) 

where we have defined H± := Hp ± Hp The evolution 
that is generated by this Hamiltonian splits into two parts 

U(t) = | ip) (ip\ <g> U-(t) + | ip) (ip\ ® U+(t) (6) 

where U±(t) = e ~ lH±t . This describes a controlled 
unitary action, in which conditioning on the qubit’s |t/>) 
and |^) states evolves the clock along two independent 
orbits according to U- and E/+ respectively. 


1 We use the terms clock and machine interchangeably, for the 
discussion here the precise distinctions between the clock degree 
of freedom and the rest of the machine is not important. 
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B. Designing a good machine: core requirements 

We now turn to the question of what clock charac¬ 
teristics besides the validity of eq. © are desirable for 
the functioning of the thermal machine. To make this a 
well-posed problem, we demand that the clock’s Hilbert 
space dimension is fixed and that the Hamiltonian has 
its spectrum upper bounded by some fixed energy-scale 
\\H S m\\ < E, but we are otherwise free in designing the 
machine’s Hamiltonians and its starting state pm{ 0). 

Firstly, the Szilard argument requires an initial degen¬ 
eracy in the energies of the qubit. Secondly, it is de¬ 
sirable (but not essential) to fix the energy of the state 
\ip) to be zero, so that the qubit’s induced level-splitting 
according to eq. © is given by A (t) := tr [p M (t)H + ]. 
Finally and most importantly we require the right co¬ 
herence properties of the machine to ensure that it func¬ 
tions well, both as a clock and in its ability to induce 
level-splittings on the qubit. These three criteria are re¬ 
spectively encoded by the following set of conditions on 
the operators {pm(0), H-, H + }: 

(i) tr[p M (0)-ff/] = 0, 

(ii) tr [p M (0)H_] = 0, 

(iii) Im (tr[p M (0)[#_,#+]]) > 0. 

Conditions |i]) and (|n| follow directly and uniquely from 
the desire for initial degeneracy and fixing of the ground 
state. The intuition behind ( [TiT[ ) is less straight forward 
and deserves some elaboration. We would like the Hamil¬ 
tonians H + and H_ to have a large commutator in an 
operator norm sense. At the level of the algebraic re¬ 
lations this implies a rapidly changing unitary evolution 
(as can be seen from expanding the unitary in increasing 
orders of commutators). But we then require the state 
to have a strong response to the induced dynamics. This 
can be achieved by having the initial state “mutually 
unbiased” with respect to the Hamiltonians. Note that 
every Hilbert space T~L admits a triple of mutually unbi¬ 
ased bases. An extreme regime is the case of H± being 
built from two of these bases, while the quantum state is 
one of the basis states in the third basis. This guaran¬ 
tees that the state simultaneously has maximal coherence 
with respect to both bases and therefore will strongly 
break time-translation invariance. Note that taking the 
imaginary part in Jni| ) is due to the fact that the trace 
will be imaginary due to the anti-Hermitian nature of 
the commutator. While not being unique, condition ( fmj ) 
provides a convenient encapsulation of these physical re¬ 
quirements. 

From eq. ([5]) we see that in the absence of any thermal 
contact, the qubit remains in the state \ip) for all time as 
the joint system freely evolves under H$m- This defines 
a reference trajectory for the clock, whose dynamics are 
fully determined by [/_. We therefore define \{t) := 
U-[t)pM (0)C/l(f) as the clock state at time t on the ideal 
reference clock-orbit. Clearly, for any given Hamiltonian 
Hsm , there is a range of states {Pfr[\o)}m that satisfy 


conditions |j]) and © and ensure validity of eq. ([2]) . Each 
of these initial clock states has its own unique clock-orbit 
associated with it. We choose the specific state 
Pm{ 0) as the state whose associated orbit y(f) maximises 
the level splitting A (t) for some t = t over the orbit, i.e. 

X(f) := argmax (maxtr[x (m) (f)-ff+]) . (7) 

This maximising state can always be taken to be a pure 
state which we call | d), where d refers to the Hilbert 
space dimension of the clock. The gap maximization then 
requires that t/_ rotates this pure state |d) ( d\ (which 
obeys conditions ([i]) and (JnJ)) into the maximum eigen¬ 
value eigenstate of H + . To achieve this, it is sufficient 
to design the machine’s Hamiltonians such that H + and 
H_ are generators of SU(2) on the d-dimensional machine 
(see appendix [A| for details). Assuming this choice, eq. 
([7]) is equivalent to optimising for condition ([hi]), so that 
the optimal machine starting state is the maximum eigen¬ 
value eigenstate of the operator C := i[H_,H + ] (which 
is also an SU(2) generator). We define the eigenbasis of 
C with ascending eigenvalues as {|m)}i< m <d, such that 
Pm{ 0) = | d) (d|. Finally, we introduce the complete ro¬ 
tating clock basis {| 771 (f)) := U-(t) |m)} which co-rotates 
with the clock’s m = d reference orbit y(f) = \d(t)) (d(t) |. 

Choosing id_ to be an SU(2) generator on the sys¬ 
tem Hilbert space 7i also has the advantage of ensuring 
closed, periodic orbits and will allow us to run the en¬ 
gine in a well-defined cycltj^J We call the period of the 
clock r, such that r is the smallest positive number for 
which [/_ (r) = 1. The d different orbits swept out by the 
{|m(t))} clock basis states over one clock period form a 
zero-energy surface with respect to the qubit’s | if) state. 
Another important property is that, for all t = nr with 
n £ INI, the orthogonal states | ni(nT)) have equal energy 
with respect to any qubit state. This follows since the 
entire eigenbasis of C satisfies condition ([!]), and so we 
may switch between clock orbits without any energy cost 
at these times. 


III. THE EXPLICIT PROTOCOL 

Having established the basic quantum-mechanical 
properties required of the engine, we now turn to the 
details of the actual engine protocol. First there is the 
initialization phase in which the initial state Q freely 
evolves for a time f at which point it attains a maximal 


2 If we do not choose such an H _, we have to impose an additional 
condition, demanding the ratio of the eigenvalues of H— to be 
rational. Note that any real number can be approximated by a 
rational number with arbitrary accuracy, and thus this restriction 
could be considered as unnecessarily strict since any clock will 
always be periodic to arbitrary accuracy. However, its period 
might approach infinity in these cases, so the restriction makes 
sense from a practical view point. 
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local level-splitting A max := A(f). This induces a local 
raising of the system’s unoccupied | ip) state, as in the 
original Szilard-type protocol. However the interaction 
Hamiltonian H$m is fundamentally time-independent, 
and so there is a need for a single timing-switch within 
the machine that initialises thermal contact between the 
qubit and the thermal bath at time t = t. It is not en¬ 
tirely clear that such a single time switching is fundamen¬ 
tally necessary, but in approaches such as the present one 
it appears to be almost impossible to avoid if one wants 
to keep the model general. 

With thermal contact in place, work extraction then 
takes place for t < t < r and can be analysed in steps 
of duration dt , which can be viewed as constituting unit 
protocols (UP). Each UP can be further analysed in terms 
of three sub-components: 

(a) Thermalisation of the system S. 

(b) Global dynamical evolution of system and machine. 

(c) Harvesting measurement on the machine M. 

It is important to note that the division into these units 
is determined by the macroscopically-controlled timing 
of measurements, and not at the level of the quantum 
machine. 

A. Autonomy of the Thermal Machine 

One might argue that the frequent measurements in a 
(as shall be seen later) time-dependent basis in step Q 
are in fact similar to a non-autonomous machine having 
a fine-tuned time-dependent Hamiltonian which is exter¬ 
nally controlled. Note though that these two scenarios 
only become comparable in the limit dt —» 0 where the 
measurements occur at a very high rate. However, our 
model is valid for arbitrary dt, and one can even con¬ 
sider the extreme case in which dt = r — f and only a 
single measurement in a fixed basis is performed at the 
end of the protocol Despite a reduced work output and 
higher probability of failure (see below), the thermal ma¬ 
chine is still able to extract work from the system, even 
though the process in this limit becomes comparable to 
the standard two-point work measurement [351 323 em¬ 
ployed in many quantum thermodynamic protocols, but 
with a fixed Hamiltonian. In the non-autonomous sce¬ 
nario if the Hamiltonian was fixed no work output would 
be possible in this case. Since the machine studied here is 
able to extract work for any numbers of interventions we 
call it quasi-autonomous. It abstractly coincides with the 
fully non-autonomous case only in the limit of continuous 
intervention dt —> 0. 


3 In this case one has to consider more realistic thermalisation 
protocols such as the ones discussed in appendix [F] The choice 
of a single thermalisation per dt, as employed in the main text, is 
only necessary for obtaining the analytical expressions, but not 
a fundamental property of the model. 


B. Dynamics of System & Thermal Machine 

The exact thermalization process can be modelled in 
various ways, including non-trivial interaction with the 
unitary dynamics generated by Hsm- However, for the 
sake of analysis we may approximate steps 0 and 0 as 
firstly a thermalization of the qubit with respect to the 
local mean-field Hamiltonian Hs(t), followed by unitary 
dynamics exp [—iHsividt]. This approximation is robust 
over a large range of parameters, and exact in the dt — > 0 
limit (see appendix [F] and discussion in [H]). 

As such, at the beginning of every UP the joint system 
is to good approximation in a state p(t) = y(t) g) x(t), 
where 7 (f) = po(t) \ ip) (ip\ + pi(t) \ip) (ip\ is the qubit’s 
Gibbs state with respect to its local, mean-field Hamilto¬ 
nian Hs(t ) (eq. |3|) at inverse temperature /3, such that 
p 0 (t) = (1 + e - ^*)) -1 and pi(t) = l-p 0 (t). 

The evolution under the Hamiltonian Hsm takes the 
joint system from p(t) to p'(t + dt) := U (dt) p(t)W (dt). 
However, since the qubit is generically in a mixed state, 
the clock now deviates from the reference clock orbitJ^J i.e. 
tr s[p'(t + dt)] 7 ^ X'(t + dt), as schematically illustrated in 
Fig- 0 Crucially, this deviation of the clock from its ref¬ 
erence orbit corresponds to energy being transferred from 
the qubit system into the machine. Since the state of the 
clock is distorted by this gain it therefore not only acts 
as a time-keeping device but also as temporary battery. 


C. Energy-harvesting and clock stabilization 

Since the quantum thermal machine suffers a back- 
action as it absorbs energy its ability to induce local- 
level splittings and to function as a clock is affected. A 
crucial component of the protocol is that we repeatedly 
perform energy-harvesting measurements on the machine 
that serve two functions: firstly to transfer the energy 
gain from the quantum to the classical regime, and sec¬ 
ondly to stabilize the clock/machine system. This in turn 
allows us to separate the concepts of clock and battery 
in the quantum-mechanical system. 

The target state on the reference clock orbit is given 
by x(t + dt), and therefore the measurement we perform 
is the projective rank-1 measurement in the orthonormal 
clock basis {| m(t. + dt))}. The ability to perform this 
measurement is assumed to be a free operation that is ac¬ 
cessible macroscopically, however this too could be mod¬ 
elled more explicitly using a larger coherent reference, if 
one wished. When the measurement is performed, with 


4 As an illustrative analogy, one can think of the state y(t) as 
the hand of a clock and dt as the fundamental unit of time, a 
’second’. If the qubit has a non-zero \ip) component, instead of 
simply ticking to the next position y(t + dt), the hand of the 
clock splits up into two parts ending up in a convex mixture of 
the expected \(t + dt) state, and the state U—(dt)x(t)U(_ (dt). 
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high probability we project back onto the reference or¬ 
bit x(t + dt), thus stabilizing the clock. This probability 
tends to one as we either decrease the thermal couplings 
or increase the rate dt~ 1 at which we perform the mea¬ 
surements (see appendices [B] and 0- 

The measurement performed during each UP does not 
commute with H$m and is thus not energy-conserving. 
It therefore leads to energy flows between the joint sys¬ 
tem and the external measurement device. Since energy 
is globally conserved we can explicitly compute the en¬ 
ergy flow into the measurement device. For the out¬ 
come corresponding to the projection II m (t + dt) := 
1 <8> | m(t + dt)) (m(t + dt )| we find this to be 


dE m (t,dt) = tr H SM {p'(t + dt) - p m (t + dt)) , (8) 


where p m (t + dt) oc II m {t + dt)p'{t + dt)H m (t + dt) is 
the post-measurement state of the system and machine 
conditioned on outcome to. 


D. Exorcising Demons: 

Landauer accounting in the macroscopic regime 

While eq. Q provides the exchange of energy between 
the quantum device and measurement apparatus, we can¬ 
not identify this as the extracted work. The reason is that 
the measurement device must be re-set and its memory 
erased in order to avoid any Maxwell’s demon type sce¬ 
narios EM]. However this memory is a classical record 
in the macroscopic apparatus and therefore the tradi¬ 
tional Landauer cost of erasure applies [5MBB1 . 

The information gain by the apparatus is described 
by the distribution of measurement outcomes p(t) := 
{p m (t,dt)}, and so the minimal cost of erasure is given 
by 

dW reset {t,dt) =/3~ 1 S(p(t)), (9) 

where S(-) is the Shannon entropy. We can therefore 
identify an averaged work gain during the UP as 

dW(t, dt) = y dt)dE m (t, dt) - dW reset (t,dt).( 10) 

m 

This provides a basic energy accounting over the unit 
protocol from time t to time t + dt. However in order 
to compose multiple UPs over the entire engine cycle re¬ 
quires us to address the matter of what happens when 
we do not obtain the projection onto the reference clock 
orbit, corresponding to m = d. 


E. Modes of operation and quantum feedback 

We now restrict our focus to two distinct operation 
modes for the engine, which we refer to as the unselective 
and selective protocols. 

In the unselective case a sequence of measurements is 
performed in intervals of length dt, and the outcomes 


recorded. Finally at the end the total measurement data 
is erased and we have a net energy gain which is the 
work output. For this we can define the trajectory m := 
{rrif,mf+dt, as the set of measurement outcomes 

of the N = consecutive UPs of a full cycle. In each 
UP, the energy flow into the measurement device given 
measurement outcome m t +dt, conditioned on starting the 
UP in state labelled by m t is 


dE(m t+ dt\m t ) = tr H S M^p(m t+ dt\mt) 


( 11 ) 


where /\p{m t+ dt\mt) := p{m t ) - p(m. t+dt \m t ) is the dif¬ 
ference between the initial state of the UP and the post¬ 
measurement state. The notation p{mt) shows explicitly 
that the clock does not necessarily start on the reference 
orbit, but on any of the orbits labelled by 1 < mt < d, 
and that the qubit is in a thermal state with respect to 
the local Hamiltonian induced by this orbit. It is im¬ 
portant to note that the energy flow only depends on 
the clock state directly preceding each UP, not the entire 
trajectory, since the thermalisation step essentially kills 
any trajectory history resulting in a Markov process (see 
appendix [B]). 

The reset at the end of of the cycle is given by the 
Landauer expression. If p( m) denotes the probability 
of a certain complete trajectory m, then the reset cost 
is given by W reset = /3 _1 5'({p(m)}). The average work 
output of the unselective engine is 

(W) u = J2p(^)E(m) - r^QKm)}), (12) 


where E{ m) = J2n=i d E { m f+ndt \mf+{n-\)dt) is the en¬ 
ergy flow for the trajectory m. The unselective protocol 
constitutes a minimalist approach, in which the quantum 
components require no feedback control. It is therefore 
the most autonomous mode of operation. 

However in this unselective regime the thermal ma¬ 
chine undergoes non-trivial back-action that degrades the 
clock. One might therefore wish to allow elementary feed¬ 
back control on the quantum systems with the aim of 
maintaining the characteristics of the machine. Feed¬ 
back control has been extensively studied in the con¬ 
text of work extraction protocols, both in the classical 
as well as in the quantum case (see e.g. [I3UI5U64H67]). 
These feedback protocols generally employ measurements 
of the target system, followed by operations which are 
chosen based on the specific result of the measurement. 
Similarly here, in the selective protocol we operate con¬ 
ditional on the measurement outcomes. If to = d we 
have a successful projection onto the reference clock or¬ 
bit, and all is well. The clock is restored back into the 
state x(f + dt), successfully stabilising it, and the joint 
state is p d {t + dt) = p' s [t + dt) <8> x(t + dt). 

For any m ^ d a non-ideal outcome occurs and the 
clock jumps into a different orbit \m(t + dt)). Moreover, 
if one reads off the outcome, then the qubit is collapsed 
into |^), resulting in a joint state p m (t + dt) = \ip) | ® 
\m(t + dt)) (m(t + dt)\ (see appendix [B| for details and 
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the specific form of p' s (t + dt )). The quantum engine has 
“misfired”, and in this case we abort the current engine 
cycle, decouple the qubit from the bath, and perform the 
following feedback process that resets the engine for a 
new cycle. The feedback on the system S flips | ip) into 
| %j)) through a single qubit unitary (whose energy cost 
has to be accounted for), followed by the free evolution 
of the joint system, “running the engine in neutral”, for a 
duration r—(t+dt ) so that the clock ends up in |m(r)) = 
|to( 0)). Crucially, since for t = mr all states of the clock 
basis have equal energy as noted above, we may now 
restore the clock to the reference orbit y(0) without any 
energy cost, ready to begin a new cycle. 


IV. ACTUAL PERFORMANCE 


The cumulative erasure cost in the selective mode is 
generally smaller than the single large erasure in the un- 
selective mode. Starting from the work expression in eq. 
(101 one can compute the average work output of this 
engine mode similar to eq. This average, which 

we shall simply call (W), needs to take into account the 
probabilities of the engine succeeding each UP, as well as 
the cost of the feedback protocol in case the engine mis¬ 
fires. The explicit expression for (W) eq. (CIO) is derived 
in appendix[C] We can also define Wideai as the maximum 
single-shot work output of a cycle that completes without 
a misfire. Although not established explicitly for general 
clocks, the selective engine employing a feedback proto¬ 
col has a higher work output W^eai > (W) > {W) u in 
all examples considered, with equality in the Zeno limit, 
which we shall now consider. 


A. Thermodynamic reversibility and the Zeno 
Limit 


The limiting case of dt —> 0 constitutes a Zeno limit 
(ZL) and takes on a special role, since it allows us to 
recover the well-established results of equilibrium ther¬ 
modynamics. Explicitly evaluating all the quantities in¬ 
volved, it is easy to show (see appendix|D| that the prob¬ 
ability of being projected back into the reference clock 
orbit is equal to unity up to first order in dt. Thus the 
selective engine will complete the entire cycle without a 
single UP failing with probability 1 — 0(dt 2 ), and the un- 
selective engine will follow the reference orbit trajectory 
{d, d, ...,d} with a probability 1 — 0(dt 2 ). It can also be 
shown that in both cases the cost of resetting vanishes 
up to first order, W reset = 0(dt 2 ), making the process 
essentially reversible. This implies that in the ZL both 
engine modes are equivalent. These results further imply 
that eq. (10) reduces to lim^^o dW(t, dt) = dE(t,dt), 
i.e. the entire energy flowing into the measurement de¬ 
vice can actually be identified as work. 


More specifically, we find for the infinitesimal work flow 


dW(t, dt) = — ipi(t)tr x(t)[U_, H + \ dt + 0(dt 2 ), (13) 


where p\(t) is the qubit’s thermal occupation with re¬ 
spect to the local Hamiltonian induced by x{t). 

We can compare this to the change in free energy of 
the qubit. Its partition function is Z(t) = 1 + e -/3A ^). 
Substituting this into the infinitesimal change in free en¬ 
ergy dF(t) = d{—pin Z(t)) we recover the quasi-static 
equilibrium result, 


lim dW(t,dt ) = — dF(t ). (14) 


Specifically, the quantum Zeno engine with a built-in 
clock, constantly stabilised via energy-harvesting mea¬ 
surements, is able to extract the entire free energy differ¬ 
ence of the system as work. Conversely, if we are not able 
to perfectly stabilise the clock at all times and only allow 
the accumulated energy to flow out of the quantum clock 
and into the classical battery at finite intervals, we are 
naturally restricted to AW < —A F. This is in accord 
with the second law, where equality can only be achieved 
under reversible protocols. 

This also demonstrates a core tradeoff between work 
output and power. On the assumption that we are ex¬ 
perimentally restricted to a minimum dt , we can either 
attempt to slow down the system dynamics to get closer 
to the ZL at the expense of power, or vice versa get a 
higher power but being further from equality in eq. (141, 


“wasting” free energy. From eq. (13) we can also see that 
if the qubit’s \tp) state is lowered too fast compared to dt , 
the clock is not able to sample the qubit’s thermal distri¬ 
bution pi(t) quick enough to utilize the full free energy 
difference. This adds to the tradeoff between maximising 
power and maximising work output. 

Integrating eq. (14) over t < t < r, the total work 


output of the Zeno engine is 


Ikzeno = kT(log 2 - log Z(f )) , (15) 


where Z(f) is the partition function of the qubit at maxi¬ 
mum level splitting A(f). This shows a second limitation 
we suffer if considering a realistic finite-sized machine. 
Even if we were able to perfectly stabilise the clock, we 
are further limited by the maximum level-splitting that 
the clock can induce. Only in the limit of an infinitely 
big (i.e. classical) clock can we reach an infinite level 
splitting A(f) -> oo (i.e. Z(t) —► 1) and thus obtain the 
classical result W = kT log 2. 


V. AN EXAMPLE: THE SPIN CLOCK 

The preceding discussion has remained abstract, and 
not tied to a specific physical realisation. It provides 
a broad framework of quasi-autonomous quantum en¬ 
gines driven by measurement-stabilised clocks. However, 
we can look more closely at an explicit example where 
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FIG. 2. Ideal Wideai (solid) and average ( W) (dashed) work 
output for a spin-clock-driven engine against clock size / for 
different stabilisation intervals dt. The red curve shows the 
Zeno limit dt —> 0 which quickly approaches the classical re¬ 
sult W = kT log 2. 


a spin-/ particle (dimension d = 21 + 1) acts as the 
clock m- The joint-Hamiltonian takes the form Hsm = 
(a ® L z + 1 ® L y ) which implies H± = (L y ±L Z )/ -\/2, 
where L & is the angular momentum operator of the spin-/ 
particle along the fc-axis. We notice that as desired H_ 
and H + are also generators of SU(2) and find the third 
generator C = i[H_,H + \ = — L x . As shown in the gen¬ 
eral framework, the optimal initial clock state %(0) is the 
eigenstate of C with maximum eigenvalue, i.e. a spin 
fully polarised along the negative x-direction. The full 
clock-basis comprises the eigenbasis of — L x . 

This example is particularly nice since the spin can be 
viewed as the quant urn-analogue of a clock hand. Under 
free evolution the state simply rotates around an 
axis defined by H_ , just like a clock hand with period 27r. 
The other clock-orbits co-rotate with the reference orbit, 
and can essentially be seen as shortened, fuzzy versions of 
the clock hand, i.e. the spin not being fully polarised in a 
certain direction but only partially. Note that if there is 
no polarisation, for example if the clock gets too mixed, 
the hand disappears, we are unable to tell the time, and 
so unable to induce time-dependence in the qubit. The 
effect of backaction from the qubit on the clock is again 
a stochastic splitting of the clock hand into two parts, 
one following the clock-orbit, the other one rotating out 
of the clock-plane. 

Applying the earlier analysis to this specific model, one 
can explicitly calculate the work output of a spin-clock- 
driven quantum engine for varying spin dimensions d and 
stabilisation intervals dt. The results are shown in Fig¬ 
ures [2] and [H We see that in the ZL even for small clocks 
we quickly saturate the classical result W = kT log 2. 
What might be surprising at first is that for finite dt 
there is an optimal d which is also finite. This comes 
back to the issue of sampling the qubit. If the clock is 
too small, we are unable to raise the | ip) state high enough 
before starting to thermalise. On the other hand, if the 
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FIG. 3. Contours of work output (normalised to kT log 2) of 
the selective engine for clock sizes l and stabilisation interval 
dt. The left plot shows the ideal scenario in which every 
measurement succeeds. The right plot is the average work 
output of the selective engine. 


clock is too large, we raise the state very high, but also 
drop it very fast and hence might not be able to sample 
the thermal distribution quick enough. For larger clocks 
the average quickly falls below the ideal output, but then 
converges again in the l 00 limit since in this case the 
optimal scenario starts to dominate the average due to 
the increasingly smaller deviation from the reference or¬ 
bit experienced by larger clocks. 

Again we see that only in the limit of an infinitely large 
clock with infinitely fast stabilisation can the classical 
limit of W = kT log 2 be recovered. 


VI. OUTLOOK 

We have presented an explicit analysis of the basic re¬ 
quirements of a quasi-autonomous quantum thermal ma- 
chiri(0 Its sole purpose is to extract work from a qubit 
degree of freedom. 

The operation of the thermal machine involves a com¬ 
plex interplay between certain core properties such as 
quantum coherence, clock degrees of freedom, and the 
role of quantum measurements. It would be of theoret¬ 
ical interest to provide more detailed accounts of these 
aspects as they relate to thermodynamics. While we have 
a good framework to understand the stochastic aspects 
of thermodynamics [2], we do not have the same level of 
clarity regarding the role that coherence has on thermo¬ 
dynamic processes. Such a problem can be tackled from 
a variety of directions, including the present approach. 

Beyond this, it would be valuable to make greater con¬ 
nection with current experimental progress related to this 
line of inquiry. Already there has been work in the con¬ 
text of optomechanical systems f68lf7Tj , and our work has 
direct bearing on these models. 


5 Note that while our discussions were focused on a machine oper¬ 
ating on pure qubits, the same derivations generalise to arbitrary 
d-dimensional systems and mixed input states (appendix [G|. 
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From the theoretical perspective it would also be fruit¬ 
ful to further develop the information feedback compo¬ 
nents, and to consider quantum memory degrees of free¬ 
dom at the level of the quantum thermal machine. In 
particular it would be valuable to explore the role that 
measurements play in such systems and to provide a 
fuller account for generic POVM scenarios. Ultimately 
this provides a useful setting to address the interplay be¬ 
tween information and energy flows across the quantum- 
classical divide. 


VII. ACKNOWLEDGMENTS 


We acknowledge useful discussions with Kais Ab- 
delkhalek and David Reeb. We also thank Janet Anders 
for helpful feedback on an earlier draft. This work was 
partially supported by the COST Action MP1209. MF 
supported by the EPSRC and the Japan Society for the 
Promotion of Science, and DJ supported by the Royal 
Society. 


[1] F. G. S. L. Brandao, M. Horodecki, J. Oppenheim, J. M. 
Renes, and R. W. Spekkens, Physical Review Letters 
111, 250404 (2013), arXiv:1111.3882 

[2] F. Brandao, M. Horodecki, N. Ng, J. Oppenheim, and 
S. Wehner, Proceedings of the National Academy of Sci¬ 
ences 112, 3275 (2015), arXiv:1305.5278 

[3] P. Skrzypczyk, A. J. Short, and S. Popescu, “Extracting 
work from quantum systems,” (2013), arXiv:1302.2811 

[4] N. Linden, S. Popescu, and P. Skrzypczyk, Physical Re¬ 
view Letters 105, 130401 (2010) 


[5] N. Brunner, M. Huber, N. Linden, S. Popescu, R. Silva, 
and P. Skrzypczyk, Phys. Rev. E 89, 032115 (2014) 


[6] M. Navascues and S 

i. Popescu, Physical Review Letters 

112, 140502 (2014), 

arXiv:1211.2101 


[7] R. Gallego, J. Eisert, and H. Wilming, arXiv (2015) 
arXiv: 1504.05056 


[8] A. J. Roncaglia, F. Cerisola, and J. P. Paz, Physical 
Review Letters 113, 250601 (2014) 

[9] J. Aberg, Nature communications 4, 1925 (2013) 
arXiv:1110.6121 

[10] M. Horodecki and J. Oppenheim, Nature communica¬ 
tions 4, 2059 (2013), arXiv:1111.3834 

[11] O. C. O. Dahlsten, R. Renner, E. Rieper, and V. Vedral, 
New Journal of Physics 13, 053015 (2011) 

[12] M. F. Frenzel, D. Jennings, and T. Rudolph, Physical 
Review E 90, 052136 (2014) 

[13] T. Sagawa and M. Ueda, Physical Review Letters 100, 
080403 (2008), 

[14] S. Toyabe, T. Sagawa, M. Ueda, E. Muneyuki, and 
M. Sano, Nature Physics 6, 988 (2010) 

[15] Y. Ashida, K. Funo, Y. Murashita, and M. Ueda, Phys¬ 
ical Review E 90, 052125 (2014), arXiv: 1404.2388 

[16] P. Skrzypczyk, A. J. Short, and S. Popescu, Nature com¬ 
munications 5, 4185 (2014) 

[17] P. Kammerlander and J. Anders, arXiv (2015) 
arXiv:1502.02673 

[18] C. Jarzynski, H. T. Quan, and S. Rahav, arXiv (2015) 
arXiv:1507.05763 

[19] G. Gour, M. P. Muller, V. Narasimhachar, R. W. 
Spekkens, and N. Yunger Halpern, Physics Reports 583, 
1 (2015), arXiv:1309.6586 

[20] N. H. Y. Ng, L. Mancinska, C. Cirstoiu, J. Eisert, and 
S. Wehner, arXiv , 12 (2014), arXiv: 1405.3039 

[21] L. Nicolas, arXiv (2015), arXiv: 1506.02322vl 

[22] S. Wehner, M. M. Wilde, and M. P. Woods, arXiv 
(2015), arXiv:arXiv:1506.08145vl 

[23] M. Lostaglio, D. Jennings, and T. Rudolph, Nature 
Communications 6, 6383 (2015), arXiv:1405.2188 


[24] P. Cwiklinski, M. Studzinski, M. Horodecki, and J. Op¬ 
penheim, arXiv (2014), arXiv: 1405.5029 

[25] M. Lostaglio, K. Korzekwa, D. Jennings, and 
T. Rudolph, Physical Review X 5, 021001 (2015) 

[26] J. Aberg, Physical Review Letters 113, 150402 (2014) 

[27] K. Korzekwa, M. Lostaglio, J. Oppenheim, and D. Jen¬ 
nings, arXiv (2015), arXiv: 1506.07875vl 

[28] E. Bruton, History Of Clocks And Watches (Crescent, 
1988). 

[29] J. Salmilehto, P. Solinas, and M. Mottonen, Physical Re¬ 
view E - Statistical, Nonlinear, and Soft Matter Physics 
89, 1 (2014), arXiv: 1401.4440 

[30] A. S. L. Malabarba, A. J. Short, and P. Kammer¬ 
lander, New Journal of Physics 17, 045027 (2015) 
arXiv: 1412.1338 

[31] S. Rankovic, Y.-c. Liang, and R. Renner, arXiv (2015) 
arXiv: 1506.01373 

[32] D. Sels and M. Wouters, “The thermodynamics of time,” 
(2015), arXiv: 1501.05567 

[33] H. Tajima and M. Hayashi, arXiv (2015), 
arXiv: 1504.06150vl 

[34] E. P. Wigner, Zeitschrift fuer Physik 133, 101 (1952). 

[35] H. Araki and M. M. Yanase, Physical Review 120, 622 
(1960) 

[36] M. M. Yanase, Physical Review 123, 666 (1961) 

[37] M. Ahmadi, D. Jennings, and T. Rudolph, New Journal 
of Physics 15, 013057 (2013), arXiv: 1209.0921 

[38] I. Marvian and R. W. Spekkens, arXiv preprint 
arXiv:1212.3378 , 5 (2012), arXiv: 1212.3378 

[39] L. Loveridge and P. Busch, The European Physical Jour¬ 
nal D 62, 297 (2011), arXiv:1012.4362 

[40] L. Szilard, Zeitschrift fuer Physik 53, 840 (1929) 

[41] S. W. Kim, T. Sagawa, S. De Liberato, and M. Ueda, 
Physical Review Letters 106, 070401 (2011) 

[42] M. B. Plenio and V. Vitelli, Contemporary Physics 42, 
25 (2001), 

[43] R. Alicki, M. Horodecki, P. Horodecki, and R. Horodecki, 
Open Systems & Information Dynamics (OSID) 11, 205 
(2004), arXiv:0402012v2 

[44] L. del Rio, J. Aberg, R. Renner, O. Dahlsten, and V. Ve¬ 
dral, Nature 474, 61 (2011), arXiv: 1009.1630 

[45] J. Teifel and G. Mahler, Physical Review E - Statistical, 
Nonlinear, and Soft Matter Physics 83, 1 (2011) 

[46] H. Weimer, M. J. Henrich, F. Rempp, H. Schroder, 
and G. Mahler, Europhysics Letters 83, 5 (2007) 
arXiv:0708.2354 

[47] H. Schroder and G. Mahler, Physical Review E - Statis¬ 
tical, Nonlinear, and Soft Matter Physics 81, 1 (2010) 
arXiv:0911.5236 





10 


[48] P. Talkner, E. Lutz, and P. Hanggi, Phys. Rev. E 75, 
050102 (2007) 

[49] M. Campisi, P. Hanggi, and P. Talkner, Rev. Mod. Phys. 
83, 771 (2011) 

[50] J. C. Maxwell, Theory of Heat, Text-books of science 
(Longmans, Green, and Company, 1872). 

[51] J. Earman and J. D. Norton, Studies in History and Phi¬ 
losophy of Science Part B: Studies in History and Philos¬ 
ophy of Modern Physics 29, 435 (1998) 

[52] K. Maruyama, F. Nori, and V. Vedral, Reviews of Mod¬ 
ern Physics 81, 1 (2009) 

[53] W. H. Zurek, arXiv (2003), arXiv:0301076 [quant-ph] 

[54] A. Chapman and A. Miyake, arXiv , 30 (2015) 
arXivT 506.09207 

[55] N. Shiraishi, S. Ito, K. Kawaguchi, and T. Sagawa, New 
Journal of Physics 17, 045012 (2015), arXiv:1501.06071 

[56] J. Bub, Studies in History and Philosophy of Science Part 
B: Studies in History and Philosophy of Modern Physics 
32, 569 (2001), arXiv:0203017 [quant-ph] 

[57] D. Mandal and C. Jarzynski, Proceedings of the 
National Academy of Sciences 109, 11641 (2012) 
http://www.pnas.org/content/109/29/11641.full.pdf 

[58] D. Mandal, H. T. Quan, and C. Jarzynski, Phys. Rev. 
Lett. Ill, 030602 (2013) 

[59] R. Landauer, IBM Journal of Research and Development 
5, 183 (1961) 

[60] C. H. Bennett, Studies in History and Philosophy of Sci¬ 
ence Part B: Studies in History and Philosophy of Mod¬ 
ern Physics 34, 501 (2003) 

[61] S. Hilt, S. Shabbir, J. Anders, and E. Lutz, Physical 
Review E 83, 030102 (2011) 

[62] P. Faist, F. Dupuis, J. Oppenheim, and R. Renner, Na¬ 
ture Communications 6, 7669 (2015), arXiv:1211.1037 

[63] J. Goold, M. Paternostro, and K. Modi, Physical Review 
Letters 114, 5 (2015), arXiv: 1402.4499 

[64] K. Funo, Y. Watanabe, and M. LTeda, Physical Review 
E 88, 052121 (2013) 

[65] S. Ito and T. Sagawa, Physical Review Letters 111, 
180603 (2013), 

[66] J. M. Horowitz and J. M. R. Parrondo, EPL (Europhysics 
Letters) 95, 10005 (2011) 

[67] J. M. Horowitz and K. Jacobs, Physical Review E 89, 
042134 (2014) 

[68] C. Elouard, M. Richard, and A. Auffeves, New Journal 
of Physics 17, 055018 (2015), arXiv: 1309.5276 

[69] O. Arcizet, V. Jacques, a. Siria, P. Poncharal, P. Vin¬ 
cent, and S. Seidelin, Nature Physics 7, 879 (2011) 
arXiv:1112.1291 

[70] T. J. Kippenberg and K. J. Vahala, Science 321, 1172 
(2008), arXiv:1303.0733vl 

[71] I. Yeo, P.-L. de Assis, A. Gloppe, E. Dupont-Ferrier, 
P. Verlot, N. S. Malik, E. Dupuy, J. Claudon, J.-M. 
Gerard, A. Auffeves, G. Nogues, S. Seidelin, J.-p. Poizat, 
O. Arcizet, and M. Richard, Nature Nanotechnology 9, 
106 (2013), arXiv: 1306.4209 

[72] J. Sakurai and J. Napolitano, Modern Quantum Mechan¬ 
ics - Second Edition (Addison-Wesley, San Francisco, 
2010 ). 

[73] H. Carmichael, An Open Systems Approach to Quantum 
Optics, Lecture Notes in Physics Monographs, Vol. 18 
(Springer Berlin Heidelberg, Berlin, Heidelberg, 1993). 


Appendix A: Designing a machine with SU(2) 
Hamiltonians 

In the main text it was stated that choosing £f_ and 
H + to be SU(2) generators, finding the clock orbit via 
eq. 0 becomes equivalent to finding the initial state 
which optimises the coherence condition ( pTij ) . Using the 
definition C := i[H_,H + \ we can rewrite condition (Jin]) 
as 

tr[p M (0)C] > 0. (Al) 

This condition is optimised by choosing p M { 0) = | d) (d| 
where |d) is the maximum eigenvalue eigenstate of C. 
Now, since H _ and H + are SU(2) generators, the defini¬ 
tion of C implies that it is also a generator of SU(2), and 
one can think of H_, H + and C as representing three 
orthogonal axes. Since the reference clock-orbit evolu¬ 
tion is given by x(i) = exp [—iH-t] |d) (d\ exp[i£T_£], i.e. 
rotating a state that lies initially along the axis defined 
by C around the orthogonal axis defined by H_, we are 
guaranteed that for some time t = f, the state x(t) will 
coincide with the eigenstate of H + which optimises eq. 

0. 

Due to the symmetry properties of SU(2) (the gener¬ 
ators essentially defining orthogonal axes), condition 0 
is satisfied for any state |m) in the eigenbasis of C, not 
only the optimal state |d), since the expectation values 
of H + and with respect to the eigenstates of C are 
zero. Thus, since the clock orbits are defined with evolu¬ 
tion under U— only, and U— and H— commute, condition 
|ii]) is in fact satisfied at all times if the clock is on any of 
the reference orbits | m{t)). Thus the set of d clock orbits 
{\m(t))}i< m <d defines a zero energy surface with respect 
to the qubit’s | ip) state. 

Finally, since as noted above (m\H_\m) = 0 = 
(m\H + \m) for all to, and Hj = \{H + — .£/_), condition 
([I]) is also satisfied for all eigenstates of C. This condi¬ 
tion does however in general only hold at the initial time 
t = 0 and after a complete period t = t. The fact that 
for times 0 < t < t this condition is broken allows us 
to induce time-dependent level-splittings with respect to 
the clock orbits, while the fact that it holds for any com¬ 
plete period t = nr allows us to freely switch between the 
different |m) states and reset the engine to the reference 
orbit without any energy cost in case it was projected on 
a different orbit during the cycle. 

These considerations show that, while not necessary, it 
is very desirable to design the quantum thermal machine 
M in such a way that H± are SU(2) generators. 


Appendix B: The protocols in detail 

In this section we shall analyse the engine protocol 
outlined in the main text in more detail. The protocol 
starts with the joint system-machine system in the state 
p(0) = \ijj) {tj}\ <8>x(0). 
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The initialisation phase consists of free evolution of the 
initial state p(O) for a time f at which point the qubit’s 
induced level-splitting 

A(f) = tT[p M {t)H + ] (Bl) 

attains a maximum A max = A(t). This free evolution 
leaves the qubit state unchanged and rotates the clock 
state into x(f) = |d(f)) (d(f)|, the maximum eigenvalue 
eigenstate of H + . 

We now start the actual work extraction process which 
takes place from t < t < r and is broken down into 
N = subroutines of duration dt, which we call unit 
protocols (UP). These subroutines are further broken 
down into three steps as outlined in 

1. Thermalisation 

The first step of each UP is thermalisation of the qubit 
with respect to its local Hamiltonian H$(t ) induced by 
the state of the clock. Assuming that the clock is on the 
reference orbit at time t, the joint state after thermalisa¬ 
tion is 

P(t) = l{t) ® x(t), (B2) 

where ^(i) = po(t) |i p) (ip\+Pi(t) \ip) (ip\ is the Gibbs state 
with respect to the reference clock orbit at time t such 
that po(t) = (1 + e -/3A ^) -1 and pi(t) = 1 — po(t) and 
A(t) = (d(t)\H + \d(t)). 

If on the other hand the previous measurement pro¬ 
jected the clock onto a different orbit m and we keep the 
engine running (as is the case in the unselective proto¬ 
col), the qubit’s local mean-field Hamiltonian takes on 
a different form, giving it a different level-splitting and 
thus thermalising to a different Gibbs state q(t| to) = 
Pg n \t) | ip) ( ip\ +p < ] m \t) | ip) (ip |, where the thermal prob¬ 
abilities are defined as above but with respect to the 
level-splitting A(t\m) = (m(t)\H + \m(t)}, leading to a 
post-thermalisation joint state of 

p(t\m) = j(t\m) (g> | m(t)) (m(t) \ . (B3) 


Note that p(t\d) = p(t), but we make the distinction to 
keep notation in the case of the selective engine more 
concise. 


2. Evolution 

Step 0 of the UP consists of free evolution of the joint 
state for a duration dt. Note that for brevity we will in 
the following assume that all evolution operators act for 
a time dt, such that e.g. U- = U-(dt), unless stated 
otherwise. Under this evolution the state p(t) evolves 
into p'(t + dt) = Up(t)U^. More explicitly, 

p'(t + dt) = p 0 (t) | ip) (ip\ <g> U-x(t)Ul_ 

+ Pi(t) W (tp\ ® U + x(t)Ul- (B4) 
The first term corresponds to the clock simply evolving 
along the reference clock orbit for a time dt, since 
U-x(t)U^_ = x(t + dt). The second term however cor¬ 
responds to a deviation of the clock from the reference 
orbit, and an injection of energy into the clock. The 
expression for p'(t + dt\m) = Up(t\m)W has a similar 
form and interpretation, only with the reference clock 
orbit x(t) replaced by the co-rotating orbit | m(t)) (m(t)\. 


3. Measurement in the selective mode 

Finally in the last step 0 of the UP we perform a 
measurement to try and stabilise the clock and trans¬ 
fer the energy it acquired during step 0 to the macro¬ 
scopic measurement apparatus. The measurement that 
will project the clock back onto one of the clock or¬ 
bits is described by the projectors n m (f + dt) := 1® 
\m(t + dt)) (m(t + dt) \. Applying this to the state p'(t + 
dt), the post measurement state given measurement out¬ 
come m is 


pm(t + dt) 


Pd (t+dt) (PoW IV’) (4>\ +Pi(t)T dd (t,dt) | ip) (ip |) ®x(t + dt) if m = d 
| ip) (ip\ (g) |m(f + dt)) (m(t + dt) \ if m ^ d, 


with probability 


Pm (t, dt) 


Po(t) + Pi(t)T dd (t,dt) iim = d 

Pi(t)T md (t,dt) if m^d, 


(B5) 


(B6) 


where we have defined 

T m f m (t,dt) := \(m'(t + dt)\U+\m(t))\ 2 . 


(B7) 


This quantity r m ' m (f, dt) can be understood as the prob¬ 
ability of the clock transitioning into the orbit labelled by 
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m' under the deviation-inducing evolution U+ for dura¬ 
tion dt, given that the the clock was in the orbit labelled 
by m at time t. It can easily be verified that the r m / m 
form a doubly-stochastic matrix, with summation over 
either of the two indices giving unity. Also note that if 
dt is small, this matrix is diagonally dominant, i.e. the 
system is more likely to remain on any given orbit than 
to transition to a different orbit. 

From eq. (B 8 ) we can directly see that if we observe 
the measurement outcome m = d, the clock has been 
successfully stabilised and projected back onto the ref¬ 
erence clock orbit at y(f + dt). The qubit is steered 
to a slightly altered state p' s {t + dt) oc Po(t) \ip) (i/j\ + 
Pi(t)Y dd {t, dt) | ip) (4>\- On the other hand, if we observe 
to ^ d, the clock transitions to a different orbit (often 
leading to a backflow of energy from the measurement 
apparatus as we shall show below) and the qubit is in¬ 
stantly collapsed to \i/j). 

In the selective operation mode of the engine, such a 
“misfire” of the engine, a measurement outcome m ^ d , 
triggers an abortion of the current engine cycle and a 
feedback procedure that resets the engine for a new cycle. 
The qubit is immediately decouple from the bath to avoid 


further thermalisation. Further, we need to reset the 
qubit to the | ip) state to ensure that the clock rotates 
along the clock-orbit and ends up in a state where it can 
be restored to the reference orbit without any further 
energy cost. The flip operation is given by the unitary 
Uf := (| ip) (ip\ + \ip) (^|)81 and takes the state p m (t+dt) 
to p' m (t + dt) = \ip) (V>| 8 \m[t + dt)) ( m(t + dt )|. This flip 
is not energy conserving and its cost has to be taken into 
account (see below). Since the qubit is now in the \ip) 
state again, we can allow the system to freely evolve for 
a duration r — (t + dt), resulting in a state \ip) (ip | 8 
|to(t)) (to(t)|. Now, due to condition |i]), we can restore 
the clock to the reference orbit at %( 0 ) for free, and the 
joint system is ready to begin a new cycle. 


4. Measurement in the unselective mode 

In the case of the unselective operation mode the state 
after measurement depends both on the measurement 
outcome m! at time t + dt , as well as the the previous 
measurement outcome to at time t. Explicitly, the post¬ 
measurement state after observing m' is given by 


p(m'\m) 


p(rn' |m) (0 IV>) W + p[ m) (t)Y m > m (t, dt) \ip) (ip\) <8 \m'(t + dt)) (m'(t + dt) I 

\ip) (ip\ 8 \m'(t + dt)) (m'(t + dt) \ 


if m' — to 
if m' 7 ^ to, 


(B 8 ) 


with probability 


p(to/|to) 


Po n \t) +p[ m) {t)T mrn (t,dt) if m' = to. 
p[ m \t)T m ' m (t, dt) if m' ± to. 


(B9) 


If the engine is run in the unselective mode, the engine 
is kept running regardless of the specific outcome to', 
and the measurement is followed by the next UP, begin¬ 
ning with a new thermalisation to the state p(t + dt\m!) 
eq. (B31, which destroys the information of the previous 
measurement, resulting in a Markov process. 


Appendix C: Energy flows 

Knowing all the quantum states of the system S and 
machine M during the entire protocol it is straight for¬ 
ward to calculate the energy flows into the measurement 
apparatus and the thermal reservoir by invoking global 
energy conservation. 


1. Work 

The energy flow into the measurement apparatus takes 
place during the measurement process. It is given by 


the energy difference of the pre- and post-measurement 
quantum states. In the selective mode, given measure¬ 
ment outcome to we have for the energy flow into the 
macroscopic apparatus 


dE m (t,dt) = tr H SM (p'{t + dt) - p m (t + dt)) .(Cl) 


Substituting the explicit expressions for the states and 
invoking condition ([h]) we find for the energy flow in the 
case of the ideal measurement outcome m = d 


dE d (t, dt) =pi(t) (d(t)\AH + (t,dt)\d(t )), (C2) 

where we have defined 

AH + (t, dt) := H + - (C3) 

Since 0 < T d d.{t,dt) < 1, the ratio 

T dd (t, dt) _ _ 1 _ ^ 

p d (t,dt) px(t) +p 0 (t)/T dd (t,dt) 


is strictly less than 1. In addition the expectation value 
of H + is greater than the expectation value of U^_H + U- 
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in eq. ( |C2| , leading to a positive energy flow into the 
apparatuijj 

However, in the case of a misfire event, a measurement 
outcome m ^ d, we have a very different energy flow, 
specifically 

dE' m ^ d (t, dt) = Pl (t) (d(t)\H+\d(t)) 

— (m(t + dt)\H + \m(t + dt)). (C5) 

The dash indicates that this is only part of the energy 
flow associated with this event (see below). Even though 
the expectation value of H + with respect to the state 
| m(t + dt)) is generally less than that with respect to 
| d(t)), the fact that 0 < pi(t) < ^ implies that in many 
cases an engine misfire implies a back-flow of energy from 
the apparatus into the quantum systeirQ Additionally 
applying the feedback and flipping the qubit via the uni¬ 
tary Uf will lead to an additional energy cost (which we 
assume is taken from the work stored in the apparatus) 

dE^t, dt) = — {m(t + dt)\H + \m{t + dt )). (C6) 

Taking this feedback cost into account, the total energy 


exchange between quantum system and measurement ap¬ 
paratus given a measurement outcome to 7 ^ d is 

dE m ^ d (t,dt) = pi{t) (d(t)\H+\d(t)) 

— 2 (m(t + dt)\H + \m(t + dt)). (C7) 

Also taking into account the cost of resetting the memory 
W reae t(t, dt) = /3~ 1 S(p(t)) given in eq. (j9|, we arrive at 
the average work output of the UP starting at time t in 
the selective mode of 

dW(t, dt) = y^ y pm(t,dt)dE m (t,dt) - dW reset (t,dt)( C8) 

m 

We can split this up and define the work associated with 
outcome to as 

dW m (t, dt) := dE m (t , dt) - dW reS et(t , dt), (C9) 

such that dW = ’Yj m PmdW m . Using this, we find for the 
total work output of the selective engine averaged over a 
full engine cycle 


fc-i 


m= (n Pd(f + ndt ^ ^ dWd{r H- ndt ) 

n =1 n=l 

N—l k -1 

+ u[n Pdfr H - (1 — Pd{ t + kdt )) ^ ^ dWd{f + ndt ) 

k—2 71= 1 71=1 

N k -1 d -1 

E[n Pd {t ^ ^ Pmir “I - kdt)dW m (f + kdt ), 


k —1 7i= 1 


771=1 


(CIO) 


where the first term corresponds to the work extracted 
in the case each of the N unit protocols succeeding, the 
second term corresponds to the work extracted up to a 
misfire at the k -th UP (which also aborts the cycle), and 
the final term contains the energy flow of the misfire at 
the fc-th UP itself, all weighted by the respective proba¬ 
bilities of these events occurring. 

For the unselective engine mode, we find the energy 
associated with with a transition from the clock orbit m 
at time t to the orbit m! at time t + dt (c.f. eq. ©) as 


dE(m!\m) = p™(t) ( (ra|if+|ra) 


r , 

A 771/777. 

p(m'\m) 


{m'\H + \m')), (Cll) 


6 Note that this is not strictly true for every UP in general if the 
clock evolves along a complicated trajectory (i.e. if the level 
splitting A(t) is not monotonically decreasing), but is always 
true on average over the interval f < t < r. 

7 Note that again 0 < Pi(t) < ^ is not necessarily true in gen¬ 
eral for complex clock evolution since A(t) can in principle get 
negative, but it holds on average over the interval f < t < r. 


where we have omitted the explicit time dependence for 
notational brevity. 

The probability of a complete engine trajectory m = 
{raf, TTir+dt, over all N unit protocols is 

N 

P{ m) = p(m?) U p(mf+ndt\mf + ( n -i)dt), (C12) 

77=1 

where the inclusion of rrif in the trajectory allows for the 
scenario in which the clock begins the protocol on a orbit 
other than the reference orbit. The total energy flow over 
this trajectory is 

N 

E(m) = £ dE{mf + ndt\m f+{n -i)dt)- (C13) 

71=1 

Finally at the end of the engine cycle we have to reset the 
memory which is associated with a work cost W reset = 
/3 -1 S({p(m)}), such that the total average work output 
of the unselective engine is given by 

(W) u = J^p(m)F(m) - /3~ 1 S({p(m)}). (C14) 
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where we have averaged over all possible engine trajec¬ 
tories. 


2. Heat 


During step 0 of each UP the qubit thermalises with 
respect to its induced local Hamiltonian by interacting 
with the thermal reservoir at inverse temperature /3. Be¬ 
fore the thermalisation at time t + dt the qubit is in the 
state 


p' s {t + dt) — 


Po(t) 


H) <V>I 


Pd{t, dt) 

= P*0 W (V’l +P*1 




p d (t,dt) 


(CIS) 


as can be seen from eq. (B8). Since in general po(t+dt) < 
Po{t) and pi (t + dt) > pi (t) (since A(t + dt) < A(t)), and 
Po > Po(t) and p\ < pi(t), we see that the interaction 
of the qubit with the machine, and the back-action of 
the measurement process on the machine actually drive 
the qubit even further away from its thermal state at 
time t + dt than it would have been otherwise, leading to 
an increased heat flow required to thermalise the qubit. 
Specifically the heat flow during the thermalisation at 
time t + dt is given by 


dQ(t + dt) = tr HsM{p{t + dt) - pd(t + dt)) 

T dd (t,dt) 


= pi(t + dt) ~pi{t) 


Pd{t , dt) 


(C16) 
A (t + dt). 


The heat flow in the classical case is given by the same 
expression but without the ratio Tdd{t,dt)/pd(t,dt) < 1, 
showing that the fully-quantum mechanical protocol has 
a higher heat flow associated with it. However, as we hall 
show below, this ratio approaches 1 in the Zeno limit, 
such the the classical result can be recovered even for 
finite-size quantum machines, as long as they can be sta¬ 
bilised infinitely fast. A similar result can be derived for 
the unselective mode of operation. 


Appendix D: Zeno Limit derivations 


In this section we derive the results for the Zeno limit 
dt —> 0, in which the clock is stabilised with infinite fi¬ 
delity, and energy flows constantly out of the quantum 
machine into the measurement apparatus. 

Let us first consider the quantity T m t m defined in eq. 


(B7). We can rewrite it as 


Pm'm(i, dt) = \(m'{t)\e lH ~ dt e lH+dt \m(t))\ . 


(Dl) 


Expanding the exponential functions ignoring terms of 
order 0{dt 2 ) or higher we have 

r m r m(t,dt) ~ \(m'(t)\(± + i.H_dt)(t — iH + dt)\m(t))\ 2 
~ \5m'm + i (m'(t)\H _ - H + \m(t)) dt\ 2 
= 5m'm - 0(dt 2 ). (D2) 


Thus the probability of the clock staying on a certain 
orbit m, despite the deviating inducing evolution gen¬ 
erated by H +1 is equal to unity up to first order in 
dt. Crucially this includes the clock orbit d such that 
T d d{t,dt) = 1 - 0(dt 2 ). 

Having established the limiting value of this central 
quantity, we can now consider the probability distribu¬ 
tion over measurement outcomes p(t). Specifically, we 
are interested in the probability associated with the clock 
being projected back onto it’s reference orbit, assuming it 
star ted t he UP in this orbit, i.e. the probabili ty P d{t, dt) 
eq. ( |B6| ) in the selective case, and p(d\d) eq. ( |B9[ ) in the 
unselective case. We first note that pd{t,dt) = p{d\d). 
We have 


Pd(t, dt) = p 0 (t) + T dd (t, dt)pi(t) 

= Po{t)+Pi(t) - 0(dt 2 ) 

= 1 - 0(dt 2 ) (D3) 

Thus in the Zeno limit we are guaranteed to project back 
onto the reference orbit and the selective and unselective 
protocols become equivalent. From this result it also im¬ 
mediately follows that the cost of resetting the memory 
vanishes for both operation modes as W reset = 0(dt 2 ). 
Thus from eq. |Io| ) we have 

dW(t,dt) = dE d (t,dt) — 0(dt 2 ). (D4) 


To find an expression for dEd(t,dt) in the dt 0 limit 
we see from eqs. (C21 and ( |C3| ) that we need to evaluate 
the expression T ^n^t) H+ U- ■ For the ratio we find 
T pd(t dt) = 1 — 0(dt 2 ) as can be verified by substituting 
eqs. (|D2|) and (|D3|). For the other factor we have upon 


expanding the exponential functions 


t/lff+tU = (1 + iH_dt)H + { 1 - iH_dt) + 0(dt 2 ) 

= H+- iH+H_dt + iH_H+dt + 0(dt 2 ) 

= H+ + i[H_,H+\dt + 0(dt 2 ) (D5) 


from which it follows that 


AH + (t,dt) = —i[H-,H + ]dt + 0(dt 2 ). 


(D6) 


Finally, substituting this expression into eq. (C2) and the 


result into eq. (D4) we arrive at the result for dW ( t , dt) 
eq. 


Let us now focus on the free energy change of the qubit 
to prove eq. (14), showing that in the Zeno limit the 


entire free energy difference can be extracted as work. 
Given the fact that we fixed the qubit’s | ip) state to zero 
energy via condition ([il]) , the partition function at time t 
is given by Z{t) = 1 + where A (t) = tr[x(i)-ff+], 

where we have assumed that the clock is on the clock 
orbit \(t) as we showed is always the case in the Zeno 
limit. The change in free energy of the qubit is thus 


dF(t) =d(-/3~ 1 log Z{t)) 
1 dZ 


= -/? 


-l 


Z{t) dt 


dt. 


(D7) 
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Differentiating the partition function with respect to t we 
get 


dZ 

dt 


at 


(D 8 ) 


The energy splitting of the qubit varies in time as 


dA 

dt 


Jt tlbcm+] 
±i X [U-(t) X (0)U'_(t)H + ] 
*tr H + ]\. 


(D9) 


Finally, recognising that e = Pi(t) and substituting 
everything into eq. (D71, we see that dF(t) = —dW ( t , dt) 
up to first order in dt, which concludes the proof of eq. 

(Ill- 

Integrating eq. (D7| over the interval t < t < r and 
noting that Z(t) = 2 due to the qubit’s degeneracy at 
t = t, e q. (15) follows immediately. 


Appendix E: Detailed analysis of the spin-clock 


As outlined in the main text, for the specific example 
of a spin -1 system acting as quantum machine/clock we 
choose the Hamiltonian 


Hsm 


1 

71 


(<7 (g> L z + 1 (g> Ly ), 


(El) 


such that H± = ± L z ), where L & is the angular 

momentum operator of the spin -1 particle along the k- 
axis. The angular momentum operators L^ clearly are 
SU(2) generators, and the operators H± can also be seen 
as angular momentum operators defining a new coordi¬ 
nate system. The third SU(2) generator can be found via 


the commutation relation 


C = i[H_,H+\ = -L x . (E2) 

Thus we see that this new coordinate system has essen¬ 
tially been flipped along the s-axis as well as rotated in 
the y-z-plane. The ideal clock state we want to choose is 
the maximum eigenvalue eigenstate of C , i.e. a spin fully 
polarised along the negative .T-direction. We define the 
eigenbasis such that C\m) = m\m). Note 

that here we use a slightly different convention from the 
remainder of the text where 1 < m < d. This is more 
suited to the angular momentum eigenbases. The state 
on the reference clock orbit at time t is thus given by 

X (t) = U-(t)\l)(l\uUt). (E3) 

Considering the rotation generated by U- (or using the 
more formal Wigner matrix derivation introduced be¬ 
low), it is straightforward to show that the level splitting 
of the qubit induced by a clock on this orbit is simply 


A(t)=/sinf. (E4) 

We see that the maximum level splitting that this clock 
can induce is A max = l at a time f = tt/2. Note also that 
even though the period of the clock is technically r = 2 - 7 T, 
it is preferable to stop the machine earlier at t' = 7r, since 
the qubit is degenerate again at this time with respect 
to all clock orbits, and keeping the engine running for 
the remaining period would at best (namely in the Zeno 
limit) lead to no additional work gain. Even though this 
was not explicitly stated in the main text, whenever the 
qubit is degenerate with respect to all clock orbits for 
some time t = r' < r it is preferable to stop the engine 
there and “run the engine in neutral” for an additional 
time t — t' to get back to the original setup. 

For the spin-clock we can explicitly evaluate the quan¬ 
tity r m / m defined in eq. (B7). Starting from the defini¬ 
tion we have 


r m 'm(t,dt) = \{m'(t + dt)\U + (dt)\m(t))f 


(m'\e lH - {t+dt) e- iH+(dt) e- lH - {t) \m) 


which looks very similar to the Wigner D-matrix , 72] 


D m ' m (a,/3, 7 ) := (m z |e 


/_/ 1 „—iL z ot ^—iL.,/3 „—iL z -y\ 

• — / m \& z & V " & z 1 \ 


m z ) 


= e -<("*'“+">7) dm/m ( / 3) 


(E5) 


(E 6 ) 


where the Wigner d-matrix d m ' rn (/d) is defined as 


J (OS_ I (l + m)\(l-m)\ yS ( l + m' \ (l - m'\ , 0, 2 l-m-m'- 2 * * (cn „P)m+m'+ 2 v 

~ V ^ V l + m-a){ a ) ( ' 1] (Sm 2 } (C ° S 2 j ' (E?) 


In (E 6 ) the states \m z ) and | m' z ) are eigenstates of the L z operator. However, in eq. (|E5|) the states are eigenstates 


of C, not H _, we thus have to introduce two identity decomposition in the F/_ basis {|)} to make use of eq. (E 6 |. 
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We have 


^rn'm (Z, dt) 


Y (m'\k '_) {k'_\e iH -^ t+dt) e- iH +^e- iH -^\k_) (k_\m) 

k— ,k'_ ——l 

2 

l 

D k f k (~{t + dt),dt,t) ( m'\k'_) (k-\m) 

k,k'=-l 

2 

l 

Y e-^ kt - k ^ t+dt) U k r k (dt){m'\k'_){k_\rn) . 
k,k'=-l 


(E8) 


Finally, noting that | to) = 
(k-\e~ lH + w / 2 \m-) = D km ( 0, 


e lH +’ n '/ 2 |m_) and using the D-matrix result eq. 
f ,0) = d km { f) and similarly (ro'|fc'_) = d m 'k '(-§)• 


(E6) again we have 
Hence we arrive at 


(fc_|?7l) = 


^rn'm (t, dt) 


]T e _i(fct-fe' (*+«**)) 4 , fe (di)^,*, (- | )d fcm ( \ ) 


(E9) 


This might not look more illuminating than the original 
expression, but the d-matrices are well known functions 
and can easily be evaluated computationally, allowing us 
to explicitly calculate results for the spin-clock example. 
We used this expression to generate the results shown 
in Figures [2] and [3] All other results follow by simply 
substituting this result into the relevant expressions. 

Let us conclude the analysis of the spin-clock example 
with the work output in the Zeno limit. Starting from 
A (t) = l sint we have 

dF{t) = pi(t)l cos(t)dt (E10) 

which upon integration from ^ < t < n yields 

Wzeno = kT (log 2 - log(l + e~ 01 )) . (Ell) 

This is the maximum work that can be extracted from a 
pure qubit if one is limited to utilise a spin-Z system as 
a clock/machine and a thermal reservoir at inverse tem¬ 
perature /3 = 1 /kT. We see that the the semi-classical 
W = kT log 2 can only be recovered for infinitely large 
clocks Z —> oo (or for zero temperature /3 —> oo). 


Appendix F: Simulating different thermalisation 
regimes 

As noted in the main text, to obtain the analytic re¬ 
sults we have to make the assumption that the thermal¬ 
isation of the qubit takes place at the beginning of each 
unit protocol, right after the measurement of the pre¬ 
ceding unit protocol. Thus evolution 0 and thermal¬ 
isation 0 are in a sense non-interacting, separated by 
the measurement (Jc]) . In this section we present simu¬ 
lation results that do not rely on this assumption but 


instead model non-trivial interactions between thermali¬ 
sation and evolution, and hence show that the approxi¬ 
mation is qualitatively robust in all the regimes consid¬ 
ered. 

We consider two ways of avoiding the assumption. In 
the first one, we simply split each unit protocol further 
into np sub-units of duration dt' = dt/np , each consisting 
of thermalisation of the qubit followed by free evolution 
of the joint system for duration dt'. The measurement is 
still only performed once per UP, at the end. Note that 
whereas in the main text the qubit always thermalises 
with respect to the local Hamiltonian induced by the ref¬ 
erence clock orbit x(f), the thermalisations in between 
t and t + dt are with respect to the local Hamiltonian 
induced by the clocks momentary state, which in gen¬ 
eral deviates from the reference orbit between measure¬ 
ments. The more sub-units np we consider, the more 
quasi-static the process becomes as the qubit more and 
more smoothly transitions from one thermal state to the 
next. Due to the immediate backaction into the clock 
by the joint evolution, this can be seen as the machine 
scanning the thermal distribution with a higher fidelity. 
In the limit np oo the process becomes a quasi-static 
equilibrium process and the work output is maximised. 
The results for the optimal scenario in which each mea¬ 
surement succeeds are shown in Fig. [4] for a spin-Z clock 
with stabilisation interval dt = 0.05 for different numbers 
of sub-units np. Note that the analytic result (which is 
equivalent to np = 1 in the simulations) is not necessarily 
more or less realistic than the results for higher np > 1, 
but can be seen as a non-equilibrium result similar to a 
finite thermalisation time of the qubit. 

To consider an even more realistic model of non¬ 
equilibrium behaviour, we can further introduce the no¬ 
tion that during each thermalisation stage the qubit is 
not instantaneously transformed into a Gibbs state, but 
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FIG. 4. Ideal work output of a machine with a spin-Z clock 
and stabilisation interval dt = 0.05, plotted against clock size Z 
for different numbers of thermalisation np per unit protocol. 
The np = 1 result is in exact agreement with the analytic 
result for dt = 0.05 in Fig. [2] The higher the value of np, the 
more continuous the state transformation of the qubit, i.e. 
the more quasi-static the protocol, leading to greater work 
output. The dashed line shows the Zeno limit. 


FIG. 5. Ideal work output of a machine with a spin-Z clock 
and stabilisation interval dt = 0.05, plotted against clock size 
Z for np = 5 finite rate equilibrations per unit protocol with 
different effective equilibration times rp. The analytic result 
with np = 1 and infinite equilibration rate rp is shown for 
comparison. Faster equilibration (larger rp) implies an evo¬ 
lution of the qubit closer to equilibrium and thus larger work 
output. Note that for large Z the qubit’s upper level drops 
faster, so that the qubit is further from equilibrium for the 
same equilibration rate, leading to reduced work output for 
large Z. 


instead undergoes an equilibration with a bosonic bath 
at a finite rate, evolving according to a standard master 
equation m- The mean bosonic occupation number n 
is given by 


1 

“ eP u - 1 


(FI) 


for a mode of frequency w, and we assume that at any 
time t the qubit only couples to the mode which it is 
in resonance with, i.e. for which u = A (t). We further 
define the clock’s states with respect to the qubit’s | ip) 
and \ip) states as 


Pm '■= Wpsm^) (F2) 

Pm : = W’IpsmIV’) (F3) 


respectively. Using this notation it can be shown that 
equilibration of the qubit with a bosonic bath for an ef¬ 
fective duration rp takes the joint qubit-machine state 
Psm to 

p'sm « I ip) (V>l 8 ( 

+ ww(F4) 


where the C x ^ y transition coefficients are given by 


Cip—yijj 
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(F5) 

(F6) 

(F7) 
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In the limit rp —t oo this model of equilibration corre¬ 
sponds to the instantaneous transformation to the Gibbs 
state considered above, but for finite rp the reduced state 
of the qubit will in general “lag behind” the Gibbs state. 
Using this finite time equilibration in combination with 
the previous notion of breaking each UP into multiple 
sub-units of equilibration and evolution before the fi¬ 
nal measurement allows us to model very realistic non¬ 
equilibrium behaviour. Figure [5] shows the results for 
the spin-clock model for np = 5 equilibration events and 
different effective thermalisation times Tp. The analytic 
result corresponding to np = 1 and rp —> oo is also shown 
for comparison. 

We see that for very large l the work output of the ma¬ 
chine is drastically reduced and approaches zero in the 
limit of large l. This is due to the fact that in this specific 
model the level splitting of the qubit is A (f) = l sinf, and 
thus the larger l, the faster the level splitting changes, 
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requiring longer effective equilibration times Tp to keep 
the qubit close to its respective Gibbs state. This is in 
some sense equivalent to the notion encountered in the 
analytic results in the main text that a larger clock (i.e. 
faster change in level splitting) requires the machine to 
sample the qubit’s thermal distribution at a higher rate 
in order to get a good work output. We again clearly 
see the tradeoff between maximising work output and 
maximising power. If we want to achieve an optimal 
work output we have to slow down the system dynamics 
(which in effect increases the ratio of effective equilibra¬ 
tion time Tp to the change in level splitting ^), which in 
turn reduces the power output of our engine. Conversely, 
increasing the power by increasing the rate of system dy¬ 
namics we end up further away from the Zeno limit where 
we can convert the entire free energy difference of the 
qubit into work, thus sacrificing potential work output. 
Even though the exact quantitative result strongly de¬ 
pends on the model parameters, we see from the results of 
this section that the analytic result based on the assump¬ 
tion of a single instantaneous thermalisation during each 
UP qualitatively contains all the core features and even 
quantitatively accurately captures the results for certain 
realistic thermalisation regimes of non-equilibrium dy¬ 
namics. Particularly as we approach the Zeno limit all 
results exactly converge, if we assume a very strong cou¬ 
pling between qubit and bath, such that the qubit always 
remains (approximately) in thermal equilibrium. 

Appendix G: Fuelling the engine with mixed states 


If we feed this state into the engine, note that the initial 
stage for times 0 < t < f is now not just the trivial 
level splitting induction in the qubit anymore, but also 
leads to a deviation of the qubit from it’s clock orbit. 
Thus we need to introduce one additional measurement 
before the first thermalisation to project the clock back 
onto its reference orbiij^J The probability P*4) of this 
measurement failing in projecting the clock back onto 


x(t) is given by 


d -1 


p(1) = ^E |(w('f)|U+(f)|<i )| 2 

(G3) 

d -1 


m—1 

(G4) 

= |(i-r dd (o,f)). 

(G5) 


If this event occurs, the engine immediately starts a feed¬ 
back process returning the qubit in the \ip) {ip | at the end 
of the cycle. Otherwise, since the qubit is now at time 
t = t back on its reference orbit and the qubit gets ther- 
malised just as if it would have on input of the ideal 
state |V>) {ip |, the machine proceeds for the remainder of 
the cycle t < t < t just as in the original protocol. As 
noted above, this second part, containing the actual work 
extraction, has a misfire and feedback event with a prob¬ 
ability 

N 

P {2) = l-\\_P d {f+ ndt). (G6) 

n —1 


In this section we consider a slight variation of the se¬ 
lective engine mode discussed in the main text and show 
how the engine behaves if instead of pure states we try 
and fuel it with (partially) mixed states. If we consider 
the machine as a black-box system which takes some state 
ps as an input, generates work, and outputs a new state 
p* s , we can say that if given the ideal input ps = \ip) {ip\ 
for which the engine was designed, we get the state 

Ps = q \ + (! - 9) IV/ (V’l (Gl) 


as the output, where q = nLi Pdf + ndt ) is the proba¬ 
bility that all N unit protocols succeed an we finish the 
cycle with a maximally mixed qubit in the state 1/2, and 
(1 — q ) is the probability that some UP fails and we per¬ 
form the feedback process which returns the qubit to the 
original state \ip) {ip\- The work output asso ciated with 
this state transformation ps —► p* s is (W) eq. (CIOI. The 
machine itself is unchanged and will by the design of the 
protocol always be in the state y(f) after outputting p* s , 
thus effectively acting as a catalyst. 

But we can also ask what happens if instead of in¬ 
putting the pure state ps into the engine, we try and 
feed the engine its own output state, the mixed p* s . We 
can rewrite the state as 


P*s = ( l-pw 


| IV/ (V>l • 


(G2) 


Putting both parts together, the chance of the machine 
experiencing feedback entering the feedback procedure at 
any point and thus returning the pure state | ip) is 

p(|^» = P (1 > + (l-P (1) )P (2 >, (G7) 

whereas the engine completes the full cycle and outputs 
the maximally mixed state 1/2 with probability 

P(f) = (1 - J P (1) )(1 - -P (2) )- (G8) 

This allows us to ask for which q = q* the machine out¬ 
puts the same state that it got as its input. This condi¬ 
tion is met when q* = p{ |), i.e. for 

/ * \ N 

q * = ( 1 - y (! - Tdd(0, f)) J p d {? + ndt). (G9) 

' ' n= 1 

In general this quantity depends on the specific machine, 
but we can explicitly evaluate it in the case of the spin 


Note that the introduction of this additional measurement in 
the original protocol, which assumes pure state inputs, would 
be trivial since in that scenario the state commutes with the 
measurement. 
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clock in the infinite size limit d —> oo. In this case we 
have T dd (0, f) 0 and II^=i Pdij + ndt) —>• 1 such that 
after rearranging we find q* = |. 

One might intuitively expect that if the engine returns 
the same state that it got as an input, that at best it 
has a zero net work output. By explicitly calculating the 
relevant expressions it can easily be shown that for the 
stationary state with q = 2/3 in the limit of a machine 
with d — > oo, the energy transferred to the measurement 
apparatus is A E = |fcTlog2, whereas the resetting cost 
of the memory is W reset > kT( log 3 — | log 2) such that 
A E—W reae t = kT( | log 2—log 3) < 0. Thus the net work 
output in this scenario is strictly negative. More realistic 
machines with finite d have even lower work output. This 
result should not be very surprising, since a state a state 
with q = 2/3 is closer to being maximally than being 
pure. 

Instead of asking which state is stationary under the 
action of the machine, we can also ask which state leads 
to a zero net work output, such that all states more 
pure than this state would result in positive work. We 
know that such a state has to be less mixed than the 
stationary state, i.e. have q < q*. The exact value 
will again strongly depend on the specific clock/machine 
used, but we can once more consider the classical equi¬ 
librium limit of an infinitely large clock d —> oo and 
the qubit being kept in equilibrium with the bath. As¬ 
suming the engine gets that far, the actual work extrac¬ 


tion stage in this limit always succeeds, outputting an 
amount kT log 2 of work. Thus it all comes down to 
the probability P*- 1 -* of the first stabilising measurement 
at t = t failing or succeeding. The measurement it¬ 
self can easily be shown to induce a zero energy change 
on average (although each individual measurement re¬ 
sult has different energy flows associated with it). Hence 
the total average energy transferred by the engine is 
A E = (1 — P^)kT log2 which in this specific limit is 
equivalent to ATI = (1 — |)TT log 2. The memory erasure 
cost during the actual work extraction stage t < t < r 
vanishes in this limit, so the only erasure cost required is 
the one of the initial measurement at t = t which is given 
by W reset = kTS({PW,l- PW}) = kTS({ f, 1 - §}). 
Hence the state with zero energy output has q = q' which 
satisfies 

(1 - log2 = - q - log | - (1 - q ~) log(l ~ q ~). (G10) 

This equation can be solved numerically to yield q' « 
0.454. For any q < q' the machine has a net positive 
work output, whereas for more mixed states with q > 
q' the work output is negative. For realistic machines 
away from the infinite d and perfect thermalisation limit 
we require even smaller q (i.e. more pure states) for a 
positive work output. 



